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Abstract —In this paper, we propose a novel low complexity 
scaling strategy of min-sum decoding algorithm for irregular 
LDPC codes. In the proposed method, we generalize our pre¬ 
viously proposed simplified Variable Scaled Min-Sum (SVS-min- 
sum) by replacing the sub-optimal starting value and heuristic 
update for the scaling factor sequence by optimized values. 
Density evolution and Nelder-Mead optimization are used offline, 
prior to the decoding, to obtain the optimal starting point and per 
iteration updating step size for the scaling factor sequence of the 
proposed scaling strategy. The optimization of these parameters 
proves to be of noticeable positive impact on the decoding 
performance. We used different DVB-T2 LDPC codes in our 
simulation. Simulation results show the superior performance 
(in both WER and latency) of the proposed algorithm to other 
Min-Sum based algorithms. In addition to that, generalized SVS- 
min-sum algorithm has very close performance to LLR-SPA with 
much lower complexity. 

I. Introduction 

Low-density parity check codes (LDPC) were introduced 
by Gallager [1] in the early 1960s. Decoding of LDPC codes, 
by log-likelihood ratio sum-product algorithms (LLR-SPA), 
are proven to achieve excellent capacity performance, by 
approaching the Shannon bound [2]. However, the drawbacks 
of LLR-SPA, namely, the high complexity and sensitivity 
to linear scaling, are solved by the Min-Sum algorithm 
[3]. Scaled Min-Sum [4] is a modification of Min-Sum 
algorithm, where a scaling factor is used to decrease the error 
introduced by using the approximate minimum operation. 
Scaled Min-Sum (with constant scaling factor) is suitable for 
regular LDPC codes. However, irregular LDPC codes require 
different scaling technique [5]—[7]. 

In [5], a two-dimensional (2D) correction of the min-sum 
was proposed. In this algorithm, different scaling factors 
are required for different check node degrees and variable 
node degrees. Consequently, the algorithm requires the 
calculation of two scaling factor vectors a and f3 with length 
equal maximum check node’s degree and variable node’s 
degree respectively. These vectors are optimized by parallel 
differential optimization of Density Evolution (DE). 

In [6], different scaling factor per iteration is proposed for 
irregular LDPC codes. Different scaling factor per iteration 
technique has good performance for irregular LDPC codes. 
However, adding these scaling factors requires complex 
calculation steps in designing stage, and requires extra storage 
to store the scaling factor value of each iteration. 

In [7], we proposed simplified variable-scaled min-sum 
(SVS-min-sum) decoding technique. This algorithm uses 
simply implemented heuristic technique to update the scaling 


factor with iterations. It is simpler than both variable scaling 
factor [6] and 2D correction Min-Sum [5] in implementing 
and designing. Simulation results show that SVS-min-sum 
has lower Bit Error Rate (BER) than constant scaling factor 
for many LDPC codes [7]. SVS-min-sum algorithm starts the 
scaling factor sequence with a constant value equals 0.5. This 
restriction decreases its performance and makes it unsuitable 
for some codes. 

In this paper, we introduce a generalization of the SVS- 
min-sum algorithm by removing the restriction of starting 
the scaling sequence from 0.5. This generalization leads to 
better performance than constant scaling for all codes. In 
fact, constant scaling can be seen as a special sub-optimized 
version of the proposed algorithm as shown in section IV. We 
apply Nelder-Mead optimization [8] on DE to jointly optimize 
the initial scaling factor and updating step of the scaling 
sequence. Simulation results illustrate the improvement of the 
proposed algorithm in both BER performance and decoding 
latency over other scaling strategies. 

The rest of the paper is organized as follows: Section II 
presents the necessary background on the SPA, Min-Sum, 
Scaled Min-Sum, Variable Scaled Min-Sum and SVS-min-sum 
algorithms. Section III presents the generalized SVS-min-sum 
algorithm. The simulation results are displayed and discussed 
in Section IV. Finally, the paper is concluded in section V. 


II. REVIEW OF THE SPA AND MIN-SUM BASED 
ALGORITHMS 

An (n,k) LDPC code is a binary code characterized by a 
sparse parity check matrix H G F™ xn where m = n — k. It 
can be represented by a Tanner graph which contains variable 
nodes j G {l..n} and check nodes i G . We denote the 

set of variable nodes connected to a certain check node i as 
V{i}. Furthermore, the set V{i}/j denotes the set of variable 
nodes connected to check node i excluding j. Similarly, the 
set of check nodes connected to a certain variable node j 
is denoted by C{j}. C{j}/i denotes the set of check nodes 
connected to the variable node j excluding i. 

LDPC codes are efficiently decoded by message passing de¬ 
coding algorithms. The main idea behind all message passing 
algorithms is processing the received symbols iteratively in 
concatenated steps that can be seen over the Tanner graph as 
horizontal step followed by vertical step. In this section, we 
review some message passing decoding algorithms that are 
either used for comparision or as the starting point of our 
modified algorithm. 



A. Sum-Product algorithm (SPA) 

One iteration of the tanh-based SPA is described in the 
following steps:- 

1) Initialization step: The LLR of bit number j is 
initialized with its channel LLR ( U c h 5 ). These initial 
values are used as Vj^i , messages from variable 
node j to check nodes i V i G C{j}. 

2) Horizontal step: At each check node i, messages 
Vj^i (which come from variable nodes V{i}) are 
used to calculate the reply messages U^j for all j G 
V{i} by (1). 

Ui^j = 2 x tank -1 ( JJ tanh V ^ 1 j (1) 

3) Vertical step: At each variable node j , messages 
Ui^j are used to calculate the reply messages 

for all i G C{j} by (2). 

Vj —>i = U c hj H - ^ ^ (2) 

i'ec{j}/i 

4) Decision step 

For each variable node j , its LLR is updated by (3) 
LLRj = U chj + U ^i ( 3 ) 

The LLR values are applied to the hard decision to decide on 
the bit value to be 1 if LLRj < 0 and zero otherwise. The 
syndrome is calculated and checked; if it is all-zero vector, 
this word is successfully decoded, otherwise, if the syndrome 
condition is not satisfied, the decoder proceeds to the next 
iteration. This process continues till either the code word is 
successfully decoded or the maximum iterations are exhausted. 


B. Min-Sum algorithm 

The Min-Sum algorithm follows the same steps as SPA. It 
only approximates the horizontal step calculation by minimum 
operation as shown in (4) [3] 
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Fig. 1. Circuit representation of SVS scaling 

TABLE I. Calculation of scaling factor value of each 

ITERATION USING SVS-MIN-SUM ALGORITHM 


Iteration index i 

a 

1 -> S 

0.5 

(S + 1) 2 S 

0.75 

(2 S + 1) -A 3S 

0.875 

(3 S + 1) -A 4S 

0.9375 


D. SVS-min-sum algorithm 

Changing the scaling factor with iteration for irregular 
LDPC codes is used in [6]. Despite of performance enhance¬ 
ment of this variable scaled min-sum algorithm, it requires 
extra storage because we need different scaling factor value 
per iteration, associated with different mutual information into 
passed messages per iteration [6]. The general fractional values 
(taken by the scaling factors) make the multiplication operation 
complex to implement. We proposed in [7] an SVS-min-sum 
algorithm addresses the particular point of simply per-iteration 
updated scaling rule. 

As stated in [6], the scaling factor should increase exponen¬ 
tially with iterations and its final value is 1. So we approximate 
the scaling factor sequence to a stair sequence which is 
updated every S iterations, increase exponentially and easy 
to implement. The variable scaling factor can be calculated 
as: 

a= l- 2 “ ri/s l (6) 


Min-Sum is easier to implement, as it gets rid of the tanh(.) 
calculation. However, the approximation of the tanh(.) to the 
min(.) leads to some loss of performance compared to the tanh- 
based SPA algorithm. This loss of performance is partially 
recovered by Scaled Min-Sum algorithm. 

C. Scaled Min-Sum algorithm 

In order to decrease the gap between the min-sum and the 
tanh-based SPA algorithms, a constant scaling factor (a < 1) 
is applied to the check node updating equation (4). In other 
words, converts the Horizontal step to (5) 

Ui = a TT sign(vjf^,i)* min (5) 

3 

This scaling factor is optimized to maximize the performance 
of Scaled Min-Sum algorithm. 


Where \i/S ] is the first integer greater than or equal to i/S . i 
is the iteration index which takes values {1,2,3...}. By using 
(6), the scaling factor of each iteration can be calculated as 
shown in table I. This sequence is:- 

1) Easy to design, because it requires a single parameter 

S. 

2) Does not need to store a specific scaling sequence for 
each code rate. It only requires to store the optimal 
updating step size S of each code rate. 

3) Easy to implement, because it only requires shifting 
right by \i/S] then subtraction. Number of required 
shifts \i/S] can be stored in a register and increased 
by 1 every S iterations. 

As shown in Fig. 1, the SVS scaling strategy is implemented by 
two sub-circuits. The first sub-circuit calculates the number of 
shifts required in each iteration \i/S ], this number of shifts 




























Fig. 2. Circuit representation of GSVS scaling 

TABLE II. Calculation of scaling factor value of each 

ITERATION USING GSYS-MIN-SUM ALGORITHM 


Iteration index i 

a 

1 S 

no 

(S + 1) —► 2 S 

0.5 + 0.5 * ao 

(2 S + 1) -A 3S 

0.75 + 0.25 * a 0 

(3 S + 1) -A 4S 

0.875 + 0.125 * a 0 


is calculated by dividing the iteration clock by S (updating 
step) to generate the updating clock of the scaling factor, then 
this updating clock is used to count the required number of 
shifts \i/S] using a counter starts by 1. The other sub-circuit 
multiplies the minimum operation output U' by <r specified in 
(6) by using a Barrel shifter to shift U' right by \i/S ] then 
subtract. 

III. Generalized SVS-min-sum (GSVS-min-sum) 

ALGORITHM 

As shown in table I, SVS-min-sum sequence starts with 0.5 
and increases exponentially with iterations. The limitation of 
starting with a fixed value of 0.5 restricts the performance to 
be sub-optimal. As a solution, we propose a new GSVS-min- 
sum algorithm where the scaling factors sequence is calculated 
by (7): 

a = l-(l-ao)*2“ (ri/sl “i) (7) 

Where <To is the initial scaling factor. By using (7), scaling 
factor of each iteration is calculated as shown in table II, where 
scaling factor values start with <to and increase exponentially 
to unity for large value of iteration index i. 

Circuit representation of GSVS scaling is shown in Fig. 
2. It is similar to SVS scaling circuit, but with two main 
differences: the first difference is that U' is multiplied by 
(1 — (To) before shifting right, this is added to generalize 
the initial scaling factor. (1 — <to) is chosen to be simply 
implemented. We use (1 — <To) to be in the form of 2~ l or 
2 -J +2 -/c , where i , j and k are integer numbers; for example, 
if i = 2 —)► (l-a 0 ) = 0.25, i = 3 —)> (1 - a 0 ) =0.125 and if 
j = 2, k = 3 -A (1 — (To) = 0.375 . The second difference is 
that the counter of required shifts starts with 0 instead of 1 
because GSVS-min-sum requires (\i/S ] — 1) shifts not \i/S] 
as in SVS-min-sum. 

In SVS-min-sum, we only need to optimize the updating 
step size S, however, in GSVS-min-sum we also need to 


optimize the initial scaling factor <ro. To calculate the op¬ 
timal ((To ,S) op t., we use Nelder-Mead optimization Method 
[8] (summarized in III-B) to minimize (E^/No) min, where 
(£)> /No) min is the minimum E^/No required to achieve pre¬ 
specified BER threshold. {E^/No)min of given (a 0 , S') is 
calculated by DE. 

A. Density Evolution (DE) of Min-Sum based algorithms 

DE checks the ability of an LDPC decoder to correctly 
decode messages with specific noise variance. This is done by 
tracing the Probability Density Function (PDF) of messages 
passed between check and variable nodes (using all-zero code¬ 
word). Using of all-zero code-word is valid in Binary Phase 
Shift Keying (BPSK) because the LLR of both 1 and 0 has 
a similar PDF shape, but this is not valid in Quadrature 
Amplitude Modulation (QAM) signaling [9]. 

DE is used in [2] to obtain the optimal weight distribution of 
irregular LDPC codes, and is used in [4] [5] to calculate the 
optimal scaling factor(s) of LDPC decoder. We use the same 
DE as in [5] after changing the PDF of channel LLR so that 
we can use it with QAM signaling. 

1) Channel LLR \s PDF of BPSK over an AWGN channel: 
In BPSK, all-zero code-word’s bits 14 = 0 are modulated 
to Xk = 1 — 214 = 1- Then Xk is transmitted over an 
Additive White Gaussian Noise (AWGN) channel with noise 
variance cr 2 , so the received sequence is = 1 + where 
rik is normally distributed random variable with mean=0 and 
variance=cr 2 . Therefore, the channel LLR ( U c h k =2 xy^/a 2 ) 
is normally distributed random variable with mean=2 /a 2 and 
variance=4/cr 2 . The PDF of U c h k is used as the initial PDF of 
variable nodes’ messages to check nodes. 

2 ) Channel LLR’s PDF of higher order QAM constellation 
over an AWGN channel: As shown in [9], for higher order 
constellations, we cannot assume that all-zero code-word was 
transmitted. Therefore, we used a similar procedure to [9], 
where authors modified the definition of bit’s LLR to be: ”LLR 
of receiving the same bit value as was transmitted ” instead of 
”LLR of receiving 0 ”. This is equivalent to replacing U c h by 
Uy, where Ut = U chk x (1-214). So U+ hh will be positive 
if and only if U c h k has the same sign as given by 14 • 

Firstly, for any constellation point W, we calculate the PDF 
of U c h fo r bit number l into W given that W is transmitted 
fu h ( u i/W) [10]. Then we calculate the average PDF of U+ h 
by (8). 

Io 92 {M )/2 

%>) = - E [Y. tu ch {ui=u/W) 

1 1=1 k wez l 

+ E fu ch (ui =-U/W)) (8) 

weo t ' 

Where l is the bit position index. Only half of bit positions 
were used, because of symmetry between real and imaginary 
axes of QAM signaling. Zi is the set of W where bit position 
l contains 0. Oi is the set of W where bit position l contains 
1. r] is PDF correction factor used to ensure that area under 
the PDF=1. 

In other words, calculate the average PDF of U c h for zeros 
and —U c h for ones. Then use this PDF as the initial PDF of 
variable nodes’ messages. 




























Fig. 3. Flow chart of Nelder-Mead method 

B. Nelder-Mead (NM) optimization method 

NM optimization [8] of (<to,5) is based on constructing 
a simplex (polygon) of 2+1=3 random solution points {Xi = 
(ao,S)i\i = 1,2 or 3} for our 2-dimension problem. After 
calculating {Eb/No) min of these three points, the worst point 
is replaced by a better point as described in the flow chart 
in Fig.3. This procedure is repeated until the simplex shrink 
enough to the optimal solution. 

We use Nelder-Mead method for many reasons: first, it 
does not need the mathematical derivative of the cost func¬ 
tion (which is not available because our cost function is 
(Eb/No)min which comes from DE). Second, Nelder-Mead 
is faster in convergence than other heuristic methods. Finally, 
our cost function has only one minimum, so the algorithm 
does not get trapped in a local minimum. To prove the last 
claim that (Eb/No) min has only one minimum, we calculated 
it for all possible combinations of (<to,5) for short length 
LDPC code with rate 0.5 specified in DVB-T2. Fig.4 shows 
that (Eb/No) m i n has only one minimum. Similar results 
are obtained for all tested codes. Optimization of (<to, 5) is 
calculated offline for each LDPC code rate, then the optimal 
value is used to implement the decoding circuit. 

IV. Simulation environment and results 

For simulation, we used (16200, 7200) eIRA LDPC code 
specified in DVB-T2 standard [11], [12]. Data are produced 
as binary bits modulated using the challenging 256-QAM 
modulation scheme and sent over an AWGN channel. The sim¬ 
ulations are performed using MATLAB platform. Maximum 
number of iterations is set to 40 iterations. 

Fig.5 shows the WER of LLR-SPA, SYS-min-sum with 



Initial alpha x 16 


Fig. 4. contours of (Eb/ V))min for short code with rate=0.5 



Fig. 5. WER of (16200, 7200) LDPC over 256QAM with different decoding 
algorithms 

S = 10 [7], Scaled Min-Sum with a = 15/16 (optimized by 
DE and the same as in [7]), GSVS-min-sum with (cto = 0.75 
and 5 = 9) (optimized by DE with Nelder-Mead method) and 
2D correction min-sum; where the output of the check nodes 
with degree 4,5,6 and 7 is multiplied by 0.94, 0.92, 0.88 and 
0.86 respectively, and the output of the variable nodes with 
degree 1,2,3 and 8 is multiplied by 1.00, 1.00, 0.91 and 0.83 
respectively [5]. Results in Fig.5 show that: 

• GSVS-min-sum has better performance than SVS- 
min-sum by 0.5 dB at WER= 10 -3 , this indicates the 
importance of the proposed algorithm, which jointly 
optimizes the initial scaling factor a 0 with the updat¬ 
ing step size S. 

• Although 2D correction min-sum has an excellent 
performance after many iterations (200 iterations) [5], 
it has higher WER than GSVS-min-sum algorithm 
after 40 iterations for the whole simulation range and 
higher than scaled-min-sum for high Eb/No range. 
The gap between GSVS-min-sum and 2D correction 
is 0.3 dB at WER= 1(T 3 . 

• There is a small gap between GSVS-min-sum and 
LLR-SPA performances (0.1 dB for low Eb/No and 
nearly disappears at high Eb/No), with much lower 
implementation complexity. 

• For the scaled min-sum algorithm, Optimizing the 
scaling factor of each code rate increases its per¬ 
formance specially for high Eb/No. This concept is 
illustrated in [7] by showing that each code rate of 
DVB-T2 LDPC codes has different optimal scaling 
factor. 

Fig.6 shows clearly that GSVS-min-sum not only has lower 

























































Fig. 6. Average number of iterations of (16200, 7200) LDPC over 256QAM 
with different decoding algorithms 

WER than other min-sum based algorithms, but also it has 
the lowest average number of iterations which leads to lower 
latency and higher average throughput. 

For more results, we used three different rates of the 
LDPC codes specified in DVB-T2 standard with BPSK, 
these codes are (16200,7200) short code with nominal rate 
= 0.5, (16200,11880) short code with nominal rate = 0.75 
and (64800,48600) normal code with rate = 0.75. WER of 
these codes with constant scaled-min-sum, SVS-min-sum and 
GSVS-min-sum decoding algorithm are shown in Fig. 7. Sim¬ 
ulation results show that GSVS-min-sum has the lowest WER 
among the three decoding algorithms, even though scaled min- 
sum has lower WER than SVS-min-sum or not. Note that: 
constant scaling (in scaled min-sum) and SVS-min-sum are 
special cases of the GSVS-min-sum where S= number of 
iterations for scaled min-sum and ao = 0.5 for SVS-min- 
sum. So GSVS-min-sum, which has optimized values for both 
ao and S, has the best performance between them as shown 
in Fig. 7. The parameters of the three decoding algorithms 
are shown in table III. For (16200,11880) code, GSVS-min- 
sum has the same performance as constant scaling factor and 
better performance than SVS-min-sum. The poor performance 
of SVS-min-sum comes from the limitation of starting by 0.5 
which is away from the optimal scaling sequence. For the other 
codes, GSVS-min-sum has better performance than both SVS- 
min-sum and constant scaled min-sum. 

V. CONCLUSION AND FUTURE WORK 

In this paper, we generalized the SVS-min-sum decoder by 
allowing it to start with any initial scaling factor ao . Simulation 
results indicated the superior performance and lower latency of 
GSVS-min-sum decoder to other min-sum based algorithms. In 
addition, GSVS-min-sum algorithm performance is very close 
to the LLR-SPA with much lower complexity. Moreover, the 
proposed algorthim is still simpler to implement than both 
the variable scaling factor in [6] and the 2D correction Min- 
Sum in [5]. As future work, we will apply our GSVS-min-sum 
decoding algorithm to layered LDPC codes implementation. 
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TABLE III. Optimal parameters of different LDPC decoding 
ALGORITHMS FOR BPSK 



constant a 

step of SVS 

(a 0 ,S) of GSVS 

(16200,7200) 

0.9375 

5 

(0.75,9) 

(16200,11880) 

0.875 

10 

(0.75, 16) 

(64800,48600) 

0.875 

10 

(0.75, 18) 



Fig. 7. WER of different LDPC cods over BPSK 
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